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Abstract — Rate and diversity impose a fundamental trade- 
off in communications. This tradeoff was investigated for Inter- 
symbol Interference (ISI) channels in [4]. A different point of 
view was explored in [1] where high-rate codes were designed 
so that they have a high-diversity code embedded within 
them. Such diversity embedded codes were investigated for flat 
fading channels and in this paper we explore its application 
to ISI channels. In particular, we investigate the rate tuples 
achievable for diversity embedded codes for scalar ISI channels 
through particular coding strategies. The main result of this 
paper is that the diversity multiplexing tradeoff for fading 
ISI channels is indeed successively refinable. This implies that 
for fading single input single output (SISO) ISI channels one 
can embed a high diversity code within a high rate code 
without any performance loss (asymptotically). This is related 
to a deterministic structural observation about the asymptotic 
behavior of frequency response of channel with respect to fading 
strength of time domain taps. 

I. Introduction 

There exists a fundamental tradeoff between diversity 
(error probability) and multiplexing (rate). This tradeoff was 
characterized in the high SNR regime for fiat fading chan- 
nels with multiple transmit and multiple receive antennas 
(MIMO) [6]. This characterization was done in terms of mul- 
tiplexing rate which captured the rate-growth (with SNR) 
and diversity order which represented reliability (at high 
SNR). This diversity multiplexing (D-M) tradeoff has been 
extended to several cases including fading ISI channels [4], 
[5]. The presence of ISI gives significant improvement of the 
diversity order. In fact, for the SISO case the improvement 
was equivalent to having multiple receive antennas equal to 
the number of ISI taps [4]. 

A different perspective for opportunistic communication 
was presented in [1], [2]. A strategy that combined high rate 
communications with high reliability (diversity) was investi- 
gated. Clearly, the overall code will still be governed by the 
rate-reliability tradeoff, but the idea was to ensure the high 
reliability (diversity) of at least part of the total information. 
These are called diversity-embedded codes [1], [2]. In [2] 
it was shown that when we have one degree of freedom 
(one transmit many receive or one receive many transmit 
antennas) the D-M tradeoff was successively refinable. That 
is, the high priority scheme (with higher diversity order) can 
attain the optimal diversity-multiplexing (D-M) performance 
as if the low priority stream was absent. However, the low 
priority scheme (with lower diversity order) attains the same 
D-M performance as that of the aggregate rate of the two 
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streams. When there is more than one degree of freedom 
(for example, parallel fading channels) such a successive 
refinement property does not hold [3]. 

In this paper we investigate the diversity embedded codes 
for an ISI channel with single transmit and receive antenna. 
Since the Fourier basis is the eigenbasis for linear time 
invariant channels we can decompose the transmission into 
a set of parallel channels. Since it is known that the D- 
M tradeoff for parallel fading channels is not successively 
refinable [3], it is tempting to expect the same for fading 
ISI channels. However, the main result of this paper is that 
for SISO fading ISI channels the D-M tradeoff is indeed 
successively refinable. The correlations of the fading across 
the parallel channels seem to cause the difference in the 
behavior. The structural observations in lemma [3] give insight 
into these correlations. This will be made more precise in the 
paper. 

The paper is organized as follows. In Section 2 we 
formulate the problem statement and present the notation. 
Section 3 gives a variation of the proof in [4] of the D-M 
tradeoff for ISI channels which makes a connection to the 
diversity embedded codes. We explore the role of correlation 
in successive refinement through a specific example. Section 
4 presents the statement and the proof for the successive re- 
finability of the D-M tradeoff for ISI channels. We conclude 
the paper with a brief discussion followed by the details of 
the proofs in the appendix. 

II. Problem Statement 

Consider communication over a quasi static fading channel 
with Inter-symbol Interference (ISI) 

y[n] = hox[n] + hix[n — 1] + . . . + h v x[n — v\ + z[n] (1) 

The v + 1 i.i.d. fading coefficients are hi ~ CAf(0, 1) and 
fixed for the duration of the block length (N + v). The 
additive noise z[n] is i.i.d. circularly symmetric Gaussian 
with unit variance. As is standard in these problems, we 
assume perfect channel knowledge only at the receiver. 

The coding scheme is limited to one quasi-static trans- 
mission block of size N + v. Consider a sequence of coding 
schemes with transmission rate as a function of SNR given 
by R(SNR) and an average error probability of decoding 
P e (SNR). Analogous to [6] we define the multiplexing rate 
r and the diversity order d as follows, 

log P e (SNR) R(SNR) 



d = lim 

SNR^oo 



log(SNR) 



SNR^oo log(SNR) ' 



(2) 

With these definitions, the D-M tradeoff for ISI channels 
was established in [4]. 



Theorem 1: [4] The diversity multiplexing tradeoff for the 
system model in (Q]i is bounded by, 

N - 



(" + 1)1- 



N 



r) <d isi (r) < (p + l)(l-r) (3) 



In this paper we explore the performance of diversity em- 
bedded codes over ISI channels. For clarity we focus on two 
streams but the procedure can be generalized to more than 
two levels. 

Let Tt denote the message set from the first information 
stream and C denote that from the second information 
stream. The rates for the two message sets as a function 
of SNR are, respectively, R H (SNR) and R L (SNR). The 
decoder jointly decodes the two message sets and we can 
define two error probabilities, P^(SNR) and P^(SNR), 
which denote the average error probabilities for message 
sets H and C respectively. We want to characterize the tuple 
(rji, dH,rL,d,L) of rates and diversities for the ISI channel 
that are achievable, where analogous to (fJJ, 

log P e H (SNR) .. Rh(SNR) 
dff = lim ^— ^ th = lim ; 

SNR^oo log(SNR) ' SNR^oo log(SNR) 

log P^(SNR) 
^ = 3^- log(SNR) > TL 



Rl(SNR) 
SNR->oo log(SNR) 



Also, we assume that dn > d^. Note that for the joint 
codebook {H, £} the total multiplexing rate is rjj + Tl 
and the diversity d = min(dH,dL) = d^. We use the 
special symbol = to denote exponential equality i.e., we write 
f(SNR) = SNR b to denote 

lo S f(SNR) =b 
SNR^oo log(SNR) 

and < and > are defined similarly. 

From an information-theoretic point of view [2] focused 
on the case when there is one degree of freedom {i.e., 
min(Mt,M r ) = 1). In that case if we consider dn > d^ 
without loss of generality, the following result was estab- 
lished in [2]. 

Theorem 2: When mm(M t , M r ) = 1, then the diversity- 
multiplexing trade-off curve is successively refinable, i.e., for 
any multiplexing rates vn and Tl such that rn + Tl < 1, 
the diversity orders d# > c?l, 



d H = d° pt (r H ), 
d L =d opt {r H +r L ) 



(4) 
(5) 



III. ISI Tradeoff 

In this section we give an alternative interpretation of 
the D-M tradeoff for ISI channels for the particular case 
of two taps i.e., v = 1. This exercise will help us to see 
the difference between the fading ISI channel and the i.i.d. 
parallel fading channel models. 

Rewriting the equation (HJ for the case of two taps, we 
have, 

y[n] = hox[n] + h\x\n — 1] + z[n] (6) 

Assume a scheme in which one data symbol is sent in every 
(v + 1) transmissions from a QAM constellation of size 
SNR r . With this strategy there is no interference between 
successive transmitted symbols and the receiver performs 
matched filtering to recover the symbol from the v + 1 
copies of the received signal. This gives us the matched filter 
upper bound to the diversity or the lower bound to the error 
probability, 



d isi (r) < 2(1 



r) 



where r is the multiplexing rate and d is the diversity order. 

For the lower bound to the diversity consider a transmis- 
sion strategy in which we assume that after transmission over 
a block length N, in the last v = 1 instants zero symbol 
is transmitted in order to avoid interblock interference. 
Therefore, the received vector over the block of length N + v 
can be written as, 



vM 
y[i\ 

y[N] 



h Q o 

hi h 

... 

... 



hi 



'•. 

hi ho 

hi ho 



x[0] 
x\l] 



x[N - 1] 




H 



(7) 



where z = [ z[0] z[l) ... z[N] ] T . Note now that the 
channel matrix H is a circulant matrix. Proceeding as in [4], 
look at the circulant matrix H = QAQ* in the frequency 
domain where Q and Q* are truncated DFT matrices and 



A is a diagonal matrix with the diagonal elements given 

hie «+i for I 



by A ; 



h 



0. 



,N. If (N + 1) 



(jv+i) 



sets of 2 



is divisible by two, we can view this as 
parallel independent channels. Communicating over each 



are achievable, where d opt (r] 
given in [6]. 



is the optimal diversity order such set of 2 parallel channels at a rate 2r, we get the 

j. The diversity for this 



N 
N+ 



Since the overall code has to still be governed by the rate- 
diversity trade-off given in [6], it is clear that the trivial 
outer bound to the problem is that dH < d opt (rH) and 
c?l < d opt (rH + Hence Theorem [2] shows that the best 
possible performance can be achieved. This means that for 
min(M f , M r ) = 1, we can design ideal opportunistic codes. 
This analysis was done for flat fading channels and we will 
show a similar theorem for ISI fading channels. 



effective multiplexing rate, f 

multiplexing rate is 2 — 2r = 2(1 — -^jp-r) as given in [6]. 

Note that in the above argument we do not utilize the 
fact that correlations exist across these sets of independent 
channels anywhere. In this case correlations did not matter 
since it achieve^ the matched filter upper bound. 

In order to illustrate the impact of correlation between the 
frequency domain coefficients, we will specifically consider 

'asymptotically in JV. 



the case for N = 3 and v = 1. Using (0 and the Fourier 
decomposition we see that we have four parallel channels, 

yi = Km + zi 1 = 0,..., 3 

In the spirit of the above parallel channel argument we 
can view these as two sets of parallel channels {ycbjb} 
and { 2/1,^3} each consisting of two sub-channels. Given 
this view since the D-M tradeoff for parallel channels are 
not successively refinable we would expect that such a 
characterization also hold for the ISI D-M tradeoff. However, 
since Ao = ho + hi, A2 = ho — hi, Ai = ho — jhi, A3 = 
ho+jhi, we see that the fading across the two sets of parallel 
channels are correlated. In particular, if \ho\ 2 > SNR~^~ r ' 

then asymptotically |A/| 2 < SNR~( 1 ~ r ' ) for at most one 
I € {0, ... ,3}. Therefore, it is possible to code across these 
sets of parallel channels to get better performance instead of 
treating them independently. 

This example gives the intuition to use the following 
method to prove the diversity multiplexing tradeoff for the 
ISI channel. Define a set A of events such that 

^={h:N 2 <^^aad N ^<-^} (8) 

For high SNR it follows that the probability of the set 
A occurring is P{A) = SNR- 2 ^"^ and P{A C ) = 1 - 
SNR -21 - 1 "^ . At each time instant independently transmit 

one symbol from a constellation with d? min > SNRy~ r > 
for A" time instants and pad it with v zero symbols. For 
detection, given that h 6 A c , we proceed as in the proof 
of lemma 3 with the detection of the N + v length trans- 
mitted sequence in the frequency domain. Clearly, the error 
probability of the scheme with this decoder, denoted by 
P® (SNR), is an upper bound to the error probability i.e., 
P e (SNR) < P°(SNR). Therefore, we can write, 

P e (SNR) = P(A)P e (SNR I h e A) + 
P(A c )P e (SNR I h e A c ) 

< P(A) + (l - SNR- 2(1 - r ^ P e (SNR I h e A c ) 

< SNR- 2{1 - r) + (l - SNR- 2(1 - r ^ P®(SNR | h e A c ) 
= SNR- 2{1 ~ r \ 

The last equality is true due to lemma [4] (given in Section 4) 
which states that if h £ A c the probability of error decays 
exponentially in SNR. This lemma in turn is based on a 
structural observation made in lemma [3] (also see Section 
3) that at most v coefficients in frequency domain will be 
smaller than mim |/i;| 2 and here given that h 6 A c at most v 
coefficients will be smaller than SNR~( 1 ~ r >. This method 
of analysis turns out to be more useful for us and takes 
into account the fact that the sets of parallel channels are 
correlated. 

This example was specifically for N = 3 and 2 taps. A 
similar analysis carries over for the case of general N and 
v. As summarized in lemma 2 in Section 4, for a finite N 
with (y + 1) taps either all the taps will be of order less than 



SNR r ' or at most v taps will be of order less than 

SNR-^- r \ 

IV. Successive Refinement of the ISI D-M 

TRADEOFF 

In this section we will formally prove the successive 
refinement of the D-M tradeoff for ISI channels. The intuition 
of the effect of fading in the frequency domain is captured 
by the following result which is proved in the appendix. 

Lemma 3: For a (v + 1) tap ISI channel we have the taps 
in the frequency domain are given by, 

A k =J2 h m e~T^ km k = {0, . . . , (N + v - 1)} 

171=0 

Define the sets T, Q and A as, 

T= {1 : |A,| 2 < SNR-V-^}, (9) 

g = {i:\Ai\ 2 = max |^| 2 }, 
ie{o,i,. ..,1/} 

A = jh : l^l 2 < j^~ r Vm e {0, 1, . . . , v} J (10) 
With these definitions we have: 

(a) ) Given that h £ A c , \T\ < v, i.e., at most v taps in the 

frequency domain are (asymptotically) of magnitude 
less than SNR-^~ r \ 

(b) ) \G C \ < v i.e., at least A^ taps of the N + v taps in the 

frequency domain are (asymptotically) of magnitude 
max(|/i | 2 , \h 1 \ 2 1 |/i„| 2 ), 

|{fc : |A fc | 2 < max(|/H>| 2 , \hi\ 2 , \K\ 2 )}\ < v. 

I 

Note that (b) along with h £ A c implies (a) and therefore is 
the stronger claim. Here is an intuition of why such a result 
will hold. Consider the polynomial 

A(z) = h m z m , 
m=0 

which evaluates to the Fourier transform for z = e . 
Hence, if we evaluate the polynomial at z = e , for 

k = {0, . . . , (N + v — 1)}, at most v values can be zero and 
at least A^ values are bounded away from zero. Therefore, if 
SNR is large enough, it is clear that at least A^ values would 
be "larger" than SN 'R~( 1 ~ r \ The details of the proof are in 
the appendix. 

Now consider transmission using uncoded QAM such 
that the minimum distance between any two points in the 

constellation d m i n is such that iS„ lin > SNR^ 1 ~ r K Defining 

T = {i : |A;| 2 < SNR-t 1 -^} we have from the lemma 
above that \T\ < v. Ignore these v channels and examine the 
remaining A^ channels in T c . We can show that the distance 
between codewords in these channels is still asymptotically 
larger than SNR^~ r > . Since the pairwise error probability is 
a Q function, we can show that the error probability decays 
exponentially in SNR. This is summarized in the following 
lemma, the proof of which is in the appendix. 



Lemma 4: Assume that the minimum distance cL 



be- 



tween any two points in the constellation (X) from which the 

signal is transmitted is df nin > SNR( 1 ~ r >. Assume uncoded 
transmission such that at each time instant one symbol is 
independently transmitted from the constellation for N time 
instants followed by a padding with v zero symbols. For a 
finite period of communication (finite N) given that h G A c 
(see (7)), the error probability P e decays exponentially in 
SNR. I 

The part (a) of lemma [3] and lemma [4] can be com- 
bined together to give an alternative proof of the diversity 
multiplexing tradeoff of the ISI channel. But to prove the 
successive refinement of the D-M tradeoff of the ISI channel 
we need the stronger result in the part (b) of lemma [3] 

We will prove a lemma analogous to lemma |4] for the case 
of superposition coding. This will be useful in our proof to 
show the successive refinement of the D-M tradeoff for ISI 
channels. Since we are padding every N symbols with v 
zeros, to communicate at an effective rate of r using uncoded 
QAM transmission, we need to send symbols from a QAM 
constellation of size SNR r where r = y~ ■ Let Xh 
be QAM constellation instant of size SNR TH and power 
constraint SNR. Similarly let Xl be a QAM constellation of 
size SNR Tl and power constraint SNR 1 " 13 , where j3 > ?h- 
As before, define 



A 



H 



h : \h m \ < 



1 



SNR 1 -^" 



Vme {0,l,...,i/} 



(ID 



Lemma 5: Using the Xh and Xl for signaling, assume 
uncoded superposition transmission such that at each time 
instant symbols are independently chosen and superposed 
from each constellation (Xh, Xl) for N time instants fol- 
lowed by a padding with v zero symbols. For a finite period 
of communication (finite N) given that h G A C H (see (fTTTi). 
the error probability of detecting the set of symbols sent from 
the higher constellation {Xh) denoted by (SNR) decays 
exponentially in SNR. 

In this lemma we critically use the fact that all except at 
most v taps in the frequency domain, are asymptotically of 
equal magnitude (max/gmi v \ |^;| 2 )- 

Using these lemmas we will prove the following theorem 
on the successive refinement. 

Theorem 6: Consider a v tap point to point SISO ISI 
channel. The diversity multiplexing tradeoff for this channel 
is successively re finable, i.e., for any multiplexing gains rjj 
and Tl such that rji + Tl < T^n; t ^ le achievable diversity 
orders given by dn(rn) and di(ri,) are bounded as, 

( 1 -^ lr ^) < d H{r H ) 

<{v + l)(l-r H ), (12) 

iv + 1) (l - ^^^h + r L ) \ < d L (r L ) 



< (v + 1) (1 - (r H + r L )) 



where N is finite and does not grow with SNR. 

Proof: To show the successive refinement we use 
superposition coding and assume two streams with uncoded 
QAM codebooks for each stream, as in [2]. Assume that 
given a total power constraint P we allocate powers Ph and 
Pl to the high and low priority streams respectively. We 
design the power allocation such that at high signal to noise 
ratio, we have SNR H = SNR and SNR L = SNR 1 -? 
for (3 <E [0, 1]. Let X H be QAM constellation instant of 
size SNR fH with minimum distance (d^ ln ) 2 = SNR 1 -^". 
Similarly let X L be a QAM constellation of size SNR TL 
with minimum distance (d^ nin ) 2 = 5A_R 1_,3_ri , where 
/3 > fa. The symbol transmitted at the k th instant is the 
superposition of a symbol from Xh, Xl given by, 



xh\ 



xl\ 



where xn[k] € Xh, xi,[k] £ Xl 



It can be shown [2] that even with the above superposition 
coding, if j3 > ?h the order of magnitude of the effective 
minimum distance between two points in the constellation 
Xh is preserved. 

The upper bound in both ( fT2l and ( TT3l ) is trivial and 
follows from the matched filter bound. We will investigate 
the lower bound in (fT2l . At each time instant superpose 
symbols from the higher and lower layers for time instants 
and pad them with v zero symbols at the end. We consider 
this particular transmission scheme and for detection, given 
that h e A C H , (where Ah is as defined in equation (TTTI) ). we 
proceed as in lemma 3. Therefore, we can write, 

P?{SNR) = P{A H )Pe(SNR | h e A H )+ 

P(A H )Pe(SNR | h e A H ) (14) 

< P(A H ) + (l - SNR-^+Vt 1 -^^ P e (SNR | h e A C H ) 

< SNR-( l,+1 K 1 -^+ 
\ - SNR-^ +1 ^ 1 ~ fH ^ P® (SNR | h e A C H ) (15) 

for communication at an effective rate of th = j^t^^h- 

For decoding the higher layer we treat the signal on the 
lower layer as noise. Given that h 6 A C H and choosing (3 > 
fn we conclude from lemma [5] that the second term in (TT~5T > 
decays exponentially in SNR. Therefore, 



P^(SNR) < 



1 



Or equivalently, 



("+1) 1- 



N ■ 



N 



th < d H (rH) 



(16) 



(17) 



Once we have decoded the upper layer we subtract its 
contribution from the lower layer. Proceeding as above, 
define 



A L = { h : \h 



< 



1 



SNR 1 -^-? 



Vm £ {0, !,...,!/} 



(13) 



(18) 



y[o] 
y[i] 

y[N] 

y[N + v] 



h 
hi h 








K 




H 



h 2 hi 

... h 2 



hi h 



x[0] 
x[l] 

x[N - 1] 

0„xl 

x 





z[0] 
z[l] 

z[N] 

z[N + v\ 



(22) 



For high SNR it follows that, 

P(A L ) = SNR-^+^-^-V, 
P(A C L ) = 1 - SNR- {u+1){1 - fL - p \ 



,1-/3 



For the lower layer we have that (d^ in ) 2 = s ^H t f L — 
SN R 1 ~ l3 ~ rL . Using lemma @] taking (3 arbitrarily close to 
?h we can conclude that, 

, ( N- 



N 



(r H + r L )j < d L (rL) 

< {u + l){l-(r H +r L )) 



(19) 

Comparing this with Theorem[T]we can see that the diversity 
multiplexing tradeoff for the ISI channel is successively 
refinable since d H (r H ) = d isi (r H ) and d L (r L ) = d isi (r H + 

r L ). ■ 
The intuition that was used in deriving the successive 
refinement of the SISO tradeoff for ISI channels was that 
given that h g A at most v taps in the frequency domain 
are zero and the remaining are "good" and of the same 
magnitude. This intuition can also be carried over to show 
the successive refinability of the SIMO channel with M r 
receive antennas and one transmit antenna. In this case, the 
received vector at the n th instant is given by, 

y[n] = hox[n} + hix[n — 1] + . . . + h^x[n — v] + z[n] (20) 

where y, h^,z G C A/ ' xl . Assume that the v + 1 fading 
coefficients are hj ~ CM(0, Im f ) and fixed for the duration 
of the block length (N + v) and h^ is independent of 
hj. Let h!f^ represent the i th tap coefficient between the 
transmitter and the p th receive antenna. We will denote 
C = circ{c\, C2, 
given by 



size SNR 1 ' . Therefore, the received vector over the block of 
length N + v can be written as in equation (l22b at the top of 
the page, where H G ^ N + V ) M ^ N ^\ y g C^ N +^ M - X \ 
z e C ( - N+ ^ AUxl and x g C Nxl . By reordering the 
rows we can write the received vector in terms of circulant 
matrices as, 



" y(D " 




r h(d i 


y(2) 






_ y(Mr) _ 







X 

0^x1 



+ 



z (i) 
Z P) 



(23) 



where H^), . . . ,H( M ^ g C^ N+l/)x ^ N+ ^ are circulant ma- 
trices given by, 



H (p) =circ{h { o p \0,...,0,hi p) 



forpg {1,2,..., M r } and, 



,(p) 



y W[l] 
y(p) [TV + v\ 



,c T } to be the T x T circulant matrix where y (p)[ n ] represents the symbol received at the p 



lh 



ci c 2 c 3 
c T ci c 2 

c 2 c 3 c 4 



CT-l 

ct- 2 



CT-l 
Cl 



(21) 



Consider a transmission scheme in which we transmit 
uncoded symbols from a QAM constellation of size SNR r 
for time instants and pad them with v zero symbols at 
the end. Consider a transmission scheme in which one data 
symbol is sent at every instant from a QAM constellation of 



receive antenna in the n th time instant. 

Since the H^ p ^ are circulant matrices we can write them 
using the frequency domain notation as = QA^Q* 

where Q, Q* g C^ N+V '> X ^ N+ ^ are truncated DFT matrices 
as defined earlier and are diagonal matrices with the 
elements given by, 



: km 



m=0 



for k = {0, .. ., (N + v — 1)}. Therefore the equation 
can be rewritten as, 



(2) 



y 

v( M 



H (2) 

H (A/ r ) 

QA^Q* 
QA( 2 )Q* 

QA( A/ -)Q 



x 

0„xl 



+ 



z (i) 

z (2) 



(24) 



X 

Oi/xl 



5 « 
,(2) 



QA< 1 )Q*x 






QA< 2 )Q*x 


+ 


z (2) 


Q A (A/.)Q* X _ 




_ z (M r ) _ 



(25) 



(26) 



where Q* is a (iV + v) x A matrix which is obtained by 
deleting the last v columns of the matrix Q* (similar as in 
the proof of lemma 0). 

Lemma 7: For a (i/ + 1) tap, M r receive antennas ISI 
channel we have the taps in the frequency domain are given 

by, 



4 P) = E ^ 



e (N + v) 



km 



771=0 



for k = {0, . ,.,(N + v- 1)} and p e {0, 
the sets and M as, 



= {k : |Ajf } | 2 < SNR-t 1 '^}, 
= {k : |A 



[ p) | 2 = max 
le{o,i,. 



|^| 2 } 



, M r }. Define 

(27) 
(28) 



X = {h: |^ p) | 2 < 



1 



v 2 e {o, . 



SNR 1 ^ 

f}> Vp e {l, 



,M r }} 



(29) 



With representing the complement of the set C?^, we 
have 

|0(p)| < v \/p 
and given that h G M c this means that, 

3pe {1,2,..., M r } s. t. \T {p) \ < v (30) 

I 

Proof: From lemma[3]for each p it is clear that \g(p) \ < 
v. Since h € M c there exists at least one (i,p) pair such 
that |/i- p) | 2 < SN Ri- r - Then from lemma [3] it follows that 
for this particular p, |jF' p )| < ^ and \G^\ < v. ■ 
As before, define 

1 



M H ^{h:\h^\ 2 < 



SNR 1 -^ 

Vie {0,1,. ..,!/}, Vpe{i,2,...,M r }} (31) 

Since all the tap coefficients are i.i.d we have that P{M) = 
SNR -M r (»+i)(i-r) and P (M H ) = SNR-^+W-r"). 



Lemma 8: Using the Xh and Xl defined earlier for sig- 
naling, assume uncoded superposition transmission such that 
at each time instant one symbol is independently transmitted 
from each constellation {Xh, Xl) for N time instants fol- 
lowed by a padding with v zero symbols. For a finite period 
of communication (finite N) given that h € M H (see (BTTl), 
the error probability of detecting the set of symbols sent from 
the higher constellation (Xh) denoted by P^(SNR) decays 
exponentially in SNR. 

We will just give an outline of the proof as the details are 
similar to the proof of lemma[5]given in the Appendix. From 
lemma [7] there exists at least one set of (N + l>) coefficients 
in the frequency domain through which Qx passes such 
that at most v taps of the available (N + v) taps in this 
set are of magnitude smaller than SNR^ ( - 1 ^ rH > . Then from 
lemma [5] it directly follows that the error probability decays 
exponentially in SNR. 

Theorem 9: Consider a v tap point to point SIMO ISI 
channel with M r receive antennas. The diversity multiplex- 
ing tradeoff for this channel is successively refinable, i.e., for 
any multiplexing gains rjj and rx such that th+tl < j^t^ 
the achievable diversity orders given by dnirn) and di(ri) 
are bounded as, 



<M r {v + l){l-r H ), (32) 

M r (y + 1) (l - ( r H + r L )^j < d L {r L ) 

< M r (v + !){!- {r H +r L )) 

(33) 

where N is finite and does not grow with SNR. 

Proof: As in theorem [6] use superposition coding and 
assume two streams with uncoded QAM codebooks Xh and 
Xl for the higher and lower priority streams respectively. 
Choose SNR H = SNR and SNR L = SNR 1 - 13 for f3 G 
[0,1]. Also let \X H \ = SNR fH and \X L = SNR fh . As in 
theorem[6]the minimum distances are (<i^j„) 2 = SN R 1 ~~ rH , 
(d^ m ) 2 = SNR 1 '?-^, and if > r H . the order of 
magnitude of the effective minimum distance between two 
points in the constellation Xjj is preserved. 

The symbol transmitted at the k th instant is the superpo- 
sition of a symbol from Xh, Xl given by, 

x[k] = xn[k] + XL[k] where xn[k] G Xh, XL[k] G Xl 



The upper bound in both (l32l and d33l is trivial and 
follows from the matched filter bound. We will investigate 
the lower bound in d32l . At each time instant superpose 
symbols from the higher and lower layers for N time instants 
and pad them with v zero symbols at the end. We consider 
this particular transmission scheme and for detection, given 
that h G M C H , (where M.h is as defined in equation OTTi). 



we proceed as in lemma 3. Therefore, we can write, 

P^(SNR) = P(M H )Pe(SNR | h G M H ) 
+ P{M c H )P e {SNR | h g M° H ) 

< SNR- M " ( - u+1 ^ 1 - fH) + (34) 

(l - SNR- Mr( -' /+1 ^ 1 - FH A P C D (SNR | h G M C H ). 

(35) 

for communication at an effective rate of r# = j^t^Th- 

From lemma [8] where we treat the signal on the lower 
layer as noise, we get that the second term in (l35T l decays 
exponentially in SNR. Therefore, 

P e H (SNR) < „, r „» f ,i lUl _ g „, (36) 



Or equivalently, 



M r {v + 1) (l - ^^r H ^j < d H {r H ) (37) 

Once we have decoded the upper layer we subtract its 
contribution from the lower layer. Proceeding as above, 
define 

Vi G {0, 1, . . . , v}, Vp G {1, 2, . . . , M r }} (38) 

For high SNR it follows that P(M L ) = 
SNR-v^+V^-tt-P). Using lemma taking (3 
arbitrarily close to fjj we can conclude that, 

M r (v + 1) (l - ^j^(r H + r L )^j < d L {r L ) 

< M r (v + !){!- {r H +r L )) 

(39) 

Comparing this with Theorem[T]we can see that the diversity 
multiplexing tradeoff for the ISI channel is successively 
refinable. ■ 

V. Discussion 

In this paper we presented the successive refinement of 
the diversity multiplexing tradeoff for the SISO ISI fading 
channel. Moreover we showed that superposition of two 
uncoded QAM constellations was sufficient to achieve this 
successive refinement. Although parallel channels are not 
successively refinable, a set of correlated parallel channels 
might be refinable. The same result holds for multiple 
receive and single transmit antenna. It would be interesting 
to investigate whether a similar result would be true for 
ISI channels with a single receive and multiple transmit 
antennas. 

VI. Appendix 

A. Proof of lemma \3\ 

Proof: The tap coefficients in the frequency domain are 
given by, 



Ak = E h * 



e («+") 



km 



k = {0,...,(N+v-l)} 



Defining = e ( JV +") the above equation can be rewritten 
as. 



m=Q 

= [1 6 



r ikrn 



k={0,...,(N + v-l)} (40) 



h h x 



Take any set of (V+l) coefficients in the frequency domain 
and index this set by JC = {k , . . . , fc„}. Define, 





' A fco " 




~ 1 


A = 


A fcl 




1 








1 



1 0* 



ho 
hi 



v 



h v 

h 



(41) 



where V G C^ +1 - )x ^ +1 ' ) is a full rank Vandermonde matrix. 
Therefore its inverse exists and we denote it by V -1 = A. 
Denoting the rows of A as, 



A = 



l( 0) 

,(D 



(42) 



we conclude that, 

a «V = and ^ (43) 

where e^^ G C lx ^ +1 ^ is the unit row vector with 1 at the I th 
position and zero otherwise. Note that the entries of {a^} 
do not depend on SNR. Therefore, 

a (0 = e (0 V -i (44) 

From fill we have, 

h = V _1 A 

Multiplying both sides by e' 1 ' and using fi4b we get, 
e«h = hi = eOv-'A ( =' a' 7 ) A = ^ af <A ki 

i=0 

Using the Cauchy-Schwartz inequality!!, we get> 



\hi\ 2 



iE^i 2 ^(Ei«" ) i 2 )(E!^ 



i=0 i=0 i=0 

Using the fact that is finite or does not grow with SNR 
it follows that the {af } do not depend on SNR. Therefore, 
the above inequality can be written as 



N 2 <|A fe0 | 2 



|A,J 2 



|A fc J 



(45) 



Note that the above inequality holds for all hi, I = 0, . . . , v. 
Therefore, we get that for any set of (y + 1) coefficients in 
the frequency domain indexed by {ko, . ■ ■ , k u }, 

max \hi\ 2 < |A feo | 2 + |A fcl | 2 + ... + |A fc J 2 (46) 

l£{0,l,. ..,!/} 

2 |u*v| < |[u|[.||v[|. 



From the Cauchy-Schwartz inequality note that, 

|A fco | 2 + |A fel | 2 + . . . + \k K | 2 = | ]T h m 6 k ° m \ 2 + ... 

v 

+ | £ h m k » m \ 2 

m=0 

V f V V \ 

<(E M a ) E i^° m i 2 + • • • + E i^ m i 2 

m=0 \m— m— / 

= (|^o| 2 + |^i| 2 + ... + |^| 2 ) 

= max |/i ; | 2 (47) 
ie{o,i,...,y} 

Combining equations d46i i and d4Tb we get, 

|AJ 2 + |A fcl | 2 + ... + |A fc J 2 = max |fc,| 2 . (48) 

ie{o,i,...,f} 

• Given that h £ „4 C we know that there exists at least one 
I such that \ht\ 2 > SNR-^~ r \ Therefore, if more than 
v taps in the frequency domain are of magnitude less 
than SN R~ { ~ v ~ r ^ choose our set K. to be these sets of 
coefficients. From equation d48l we get a contradiction. 
Therefore \T\ < v proving (a). 

• We know from d47l i that, 

|A fe | 2 < max \hi\ 2 Mk 
Je{o,i,. 

Since, Q = {i : lA.^ 2 = max /e{0il: ... il/} |/i;| 2 }, 

I A J 2 < max \hA 2 VkeG c . 
le{o,i,. ..,!/} 

If |5 C | > f then there exists a set JC = Q c of size at 
least v + 1 such that, 

|A fe | 2 < max \hi\ 2 Vfce/C 
le{o,i,. ...i/} 

But this is a contradiction to equation (08]) and therefore 
we have |C? C | < f proving (6). 

■ 

B. Proof of lemma [4] 

Proof: Consider the case where we have v + 1 taps, 

i.e., 

V 

y[n] = h m x[n — m] + z[n] 

m=0 

We receive a vector of length (N+v) denoted by y. Denoting 
the transmitted sequence of length N by x G X , the v zero 
symbols padded at the end by 0„ x i and the circulant channel 
matrix as H, we have 

y = H[ x 0„ xl ]* + z (49) 

Similar to analysis [4] we can write the circulant ma- 
trix H = QAQ* where Q G N+ ^ x{N+ ^ and Q* G 
c (N+v)x(N+v) ^ truncate( i OFT matrices and A G 



q(N+v)x{n+v) j s a diagonal matrix with the diagonal el- 
ements given by, 

Ak=J2 h m e-W^ km k = {0, . . . , (N + v - 1)} 

m— 

and the entries of Q are given by, 

(Q)p, = e ~TW+=T M for < p < (AT + v), < g < (N + v) 

Note that Q is a Vandermonde matrix. Multiplying the 
received vector by Q* we get, 

y = Q*y = AQ*[x uxl ]* + Qz = AQ*x + z 

where Q* G £.( N +») xN i s a matrix obtained by deleting the 
last v columns. Note that Q* is also a Vandermonde matrix 
which implies it has rank N. 

Note that from lemma [3] we know that at most v taps of 
the available (N + v) taps in the frequency domain can be of 

magnitude A^j 2 < SNR~( 1 ~ r >. Define a selection matrix 
S G C Nx( - N+ ^ such that, 

SAQ* = AQ 

where, A G C NxN , A = diag({Ai : I G T c }). Similarly 
Q G C NxN is the matrix Q with the v rows corresponding 
to {A; : I G T} deleted. Note that Q is still a full rank (rank 
N) Vandermonde matrix and denoting the singular values of 
AQ by 7j. we have > SNR~( 1 ~ r ' . Using this selection 
matrix we have, 

y = Sy = AQx + z (50) 

Due to the fact that we are using uncoded QAM for transmis- 
sion, the minimum norm distance between any two elements 
x/x'£ X N is lower bounded by, 

||x-x'|| 2 > SNR^~ r \ 

From the fact that Q is full rank its smallest singular value 
is nonzero and independent of SNR. Defining x = Qx we 
can conclude that, 

||* - x'|| 2 = ||x - x'|| 2 > SNR^ 1 -^ (51) 
As A is a diagonal matrix, 

JV-l N-l 

||A(x - x')|| 2 = J2 l A '(* - *')*l 2 = E l A 'l 2 K* - x ')'l 2 

2=0 (=o 

N-l 

= SNR-^-^ J2 K* ~ X ')'l 2 (52) 
1=0 

= SJVi2- (1 - r)+e ||(x-x / )|| a 

> SNR- {1 ~ r)+e SNR (1 - r *> = SNR £ 

where (T52t is true from lemma|3]for some e > 0. Since Q(x) 
is a decreasing function in x, using the above equation, we 
conclude that the pairwise error probability of detecting the 



sequence x' given that x was transmitted is upper bounded 

by, 

P e (x^x') < Q (||A(x-x')|| 2 ) < Q(SNR e ) 
Therefore, by the union bound we have, 

Pe(SNR) < SNR r Q (SNR e ) < SNR^'^ 21 

as Q(x) decays exponentially in x for large x i.e., Q(x) < 
e 2 . Hence it follows that given h G A c the error 
probability decays exponentially in SNR. Note that we only 
use the weaker form of lemma [3] over here i.e. we need at 
least N tap coefficients to be large but we don't need them 
to be of the same magnitude. ■ 

C. Proof of lemma [5] 

Proof: For decoding the higher layer we treat the signal 
on the lower layer as noise. Proceed as in the previous lemma 
(equation |50T > with the selection matrix S chosen such that 
A = diag ({A ( : I G £?}), where \Q\ > N. We get, 

y = Sy = A Qxh +A Qxl +z 

Xff XL 

= Axff + Axjj + z 



T mm = min I Ai| , 
Therefore, from lemma [3] we get 



F m ax = max | A, | 

ieg 



T mm = T max = max IhA 2 = SNR-^- p ^ +2e 



iG{0,l, 

where the last equality follows for some e > from lemma 

Has h G A° H . Since ||x L || 2 < SNR 1 '' 3 and from equation 
(fSTji in the previous lemma, we can lower bound SI as, 



n>r^ n ||(x ff -x ir )|[-2r^ aa ||x L | 



SNR- 



XL 



= SNR — +t (SNR—^~ - SNR— 

= SNR" 

where the last step is valid as (3 > ?~# Therefore, 

Pf (x fl ^x H ) < Q{SNR e ) 

which decays exponentially in SNR. By the union bound as 
in the previous lemma we conclude that P^ (SNR) decays 
exponentially in SNR. 



= Ax/f + z 

The decoding rule we use to decode x# is given by, 
x H = argmin ||y - AQx ff || 2 

Therefore, the pairwise error probability of detecting the 
sequence x H given that xh was transmitted is given by, 

F?(x H ^x H )= J2 ^(x£)P e (xH ^x^|A,x L ) 

= ^(x i )Pr(||y-Ax ff || 2 > ||y-Ax H || 2 ) 

xl ex" 

= Pr(xL)Q(\\M*H-*' H )\\ + 

< A(x H -x' H ), Ax L >\ 



2Re- 



\A(x H -&' H )\ 



(53) 



Note that Q(x) is a decreasing function in x. Therefore the 
equation d53l is upper bounded by, 

Pf (Xfl ^ x ff ) < 

Pr(* L )Q \\\H*h- x H )|| -2||Aii| 
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